Efficient general lattice generation and rescoring

نویسندگان

  • Andrej Ljolje
  • Fernando Pereira
  • Michael Riley
چکیده

We describe a lattice generation method that produces highquality lattices with less than 10% increased computation over standard Viterbi decoding. Using the North American Business News (NAB) task, we show our method is within 0.2% in lattice word-error rate of ‘full lattices’, which are those that contain all the recognition hypotheses within the search beam. Our method is closely related to previous lattice generation methods, but applies to more general network topologies. We also give real-time results on the NAB task, in which we generate lattices in a first pass and then rescore them with stronger acoustic and language models in a second pass. We are able to achieve at 3x real-time a word error rate of 11.2% on the Eval ’95 test set, which is only 1.7% worse than AT&T’s official benchmark result that year using what was then a 1000x real-time system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lattice rescoring methods for statistical machine translation

Modern statistical machine translation (SMT) systems include multiple interrelated components, statistical models, and processes. Translation is often factored as a cascaded series of modules such that the output of one module serves as the input to the next; this is the SMT pipeline. Simplifying assumptions, limited training data, and pruning during search mean that the hypothesis produced by ...

متن کامل

On-the-fly lattice rescoring for real-time automatic speech recognition

This paper presents a method for rescoring the speech recognition lattices on-the-fly to increase the word accuracy while preserving low latency of a real-time speech recognition system. In large vocabulary speech recognition systems, pruned and/or lower order n-gram language models are often used in the first-pass of the speech decoder due to the computational complexity. The output word latti...

متن کامل

Square Lattice Elliptical- Core Photonic Crystal Fiber Soliton-Effect Compressor at 1550nm

 In this paper, we investigate the evolution of supercontinuum and femtosecond optical pulses generation through square lattice elliptical-core photonic crystal fiber (PCF) at 1550 nm by using both full-vector multipole method (M.P.M) and novel concrete algorithms: symmetric  split-step Fourier (SSF) and  fourth order Runge Kutta (RK4) which is an accurate method to solve the general  nonlinear...

متن کامل

Lattice Rescoring for Speech Recognition using Large Scale Distributed Language Models

In this paper, we suggest a lattice rescoring architecture that has features of a Trie DB based language model (LM) server and a naïve parameter estimation (NPE) to integrate distributed language models. The Trie DB LM server supports an efficient computation of LM score to rerank the n-best sentences extracted from the lattice. In the case of NPE, it has a role of an integration of heterogeneo...

متن کامل

Neural Machine Translation by Minimising the Bayes-risk with Respect to Syntactic Translation Lattices

We present a novel scheme to combine neural machine translation (NMT) with traditional statistical machine translation (SMT). Our approach borrows ideas from linearised lattice minimum Bayes-risk decoding for SMT. The NMT score is combined with the Bayes-risk of the translation according the SMT lattice. This makes our approach much more flexible than n-best list or lattice rescoring as the neu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999